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Abstract: A cold standby system of two identical units is 
analyzed stochastically under priority in repair disciplines. 
Unit has a constant failure from normal mode. There is a 
single server who visits the system immediately to carry out 
repair activities. Server conducts preventive maintenance of 
the unit after a pre-specific time't' of operation. However, 
repair of the unit is done by the server at its failure. Priority is 
given to repair of one unit over preventive maintenance of the 
other unit. The random variables associated with different 
rates are statistically independent. The failure time and time by 
which unit goes for preventive maintenance follow exponential 
distribution while the distributions of maintenance and repair 
times are taken as arbitrary with different probability density 
functions. The expressions for several reliability measures of 
the system are derived using semi-Markov process and 
regenerative point technique. Graphs are drawn to depict the 
behavior of MTSF, availability and profit function for 
particular values of various parameters and costs. 
Key words: Cold standby system / Maximum operation time / 
Preventive maintenance / Repair / Priority / and Stochastic 
Analysis. 
Introduction 

The performance of operating systems can be improved 
by providing redundancy as well as by conducting maintenance 
and repair of the failed components. And, cold standby systems 
are one of the most important structures in reliability 
engineering. Therefore, a lot of research work has been done on 
reliability modeling of cold standby systems considering 
different aspects on failure and repair policies. Gopalan and 
Naidu (1982), Cao and Wu (1989), Lam (1997) and Yadavalli et 
al. (2004) developed reliability models of a cold standby system 
with single repair facility. 

Further, continued operation and ageing of operable 
systems reduce their performance, reliability and safety. 
Therefore, preventive maintenance of such systems can be 
conducted after a pre -specific time (called maximum operation 
time) in order to slow the deterioration process. Recently, Malik 
(2013) investigated a reliability model of a computer system 
with cold standby redundancy and preventive maintenance. 
Also, sometimes it becomes necessary to give priority in repair 
disciplines of one unit over the other in order to reduce down 
time of the system. Chhillar et al. (2013) analyzed a parallel 
system with priority to repair over maintenance subject to 
random shocks. 

In view of the above practical situations in mind, here a 
reliability model for a two-unit cold standby system is developed 
considering the aspect of priority in repair disciplines. The units 
are identical in nature which may fail directly from normal 
mode. There is a single server who visits the systemimmediately 
for conducting maintenance and repair. Server conducts 



preventive maintenance of the unit after a maximum operation 
time't'. However, repair of the unit is done at its complete 
failure. Priority is given to repair of one unit over maintenance 
of the other unit. The random variables associated with failure 
time, completion of maximum operation time, preventive 
maintenance and repair times are statistically independent. The 
failure time and time by which unit goes for preventive 
maintenance follow exponential distribution while the 
distributions for maintenance and repair times are taken as 
arbitrary. Several measures of system effectiveness such as 
transition probabilities, mean sojourn times, mean time to 
system failure (MTSF), availability, busy period of the server 
due to maintenance and repair, expected number of 
maintenances and repairs of the unit and profit function are 
obtained using semi-Markov process and regenerative point 
technique. Craphs are drawn to depict the behavior of MTSF, 
availability and profit function for particular values of various 
parameters and costs. 
Methodology 

The semi-Markov and regenerative point processes are 
adopted to obtain reliability measures of the system model. A 
brief description as follows: 
Semi-Markov Process 

It is a continuous -time stochastic process in which 
transition from one state to another is governed by the transition 
probabilities of a Markov process and time interval between two 
successive transitions is a random variable whose distribution 
may depend upon the state from which transition takes place as 
well as on the state to which next transitions take place. 
Regenerative Process 

It is a stochastic process with time points at which, 
from a probabilistic point of view, the process restarts itself. 
These time points may themselves be determined by the 
evolution of the process, i.e., the process {X(t), t > 0} is a 
regenerative process if there exist time points < T < Ti < T 2 < 
... such that the post-T k process {X(T k+t ) : t > 0} has the same 
distribution as the post-T process {X(T0 + t) : t > 0} and is 
independent of the pre-T^ process {X(t) : < t < T^}. These time 
points are called regenerative points. Regenerative stochastic 
process was defined by Smith (1955) and has been crucial in the 
analysis of the complex system. 
Notations and Assumptions 



E : Set of regenerative states. 

O/Cs : The unit is operative/cold standby 

do : The constant rate by which unit undergoes 

for preventive maintenance 

X : Constant failure rate of the unit. 

f(t)/F(t) : pdf /cdf of preventive maintenance time 

g(t)/G(t) : pdf /cdf of repair time of a failed unit 

P„/WP m /PM The unit is under preventive 
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FU/Fw/FUR 
qij(t) / Qij(t) 
qij.k(t) / Qijk(t) 
mij 
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maintenance/waiting for preventive 

Maintenance / under preventive maintenance 

continuously from previous state. 

:The failed unit is under repair/waiting for 

repair/under repair continuously from previous 

state. 

:pdf and cdf of direct transition time from a 
regenerative state Sj to a regenerative state Sj 
without visiting any other regenerative state 
:pdf and cdf of first passage time from a 
regenerative state S; to a regenerative state Sj or 
to a failed state Sj visiting state k once in (0,tj. 
: The contribution to mean sojourn time in 
regenerative state Sj when system is to make 
transition in to regenerative state Sj . 
Mathematically, it can be written as 



m ij =E(T ij ) = jtd[QJt)] 



Mi 



o 

where Ty is the transition time from state 
Si to Sj; Si , Sj e E. 

: The mean Sojourn time in state Sj this is 
given by 



JUj = E(Ti) = I P{T i > t)dt = Im, , where 
; 

T i is the sojourn time in state S;. 

Wi(t) : Probability that the server is busy in the state 

Si up to time 't' without making any transition 
to any other regenerative state or returning to 
the same state via one or more regenerative 
states. 

®/© :Symbol for Laplace Stieltjes 

convolution/Laplace convolution 

~/* : Symbol for Laplace Steltjes transform (LST)/ 

Laplace transform (LT) 

The possible transition diagram of the system model is shown in 

fig. 1. 

State Transition Diagram 
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Fig 1. 

Transition Probabilities and Mean Sojourn Times 

Simple probabilistic considerations yield the 
expressions for non-zero elements 



following 



Pij 

Poi = 

Pl3 = 
Pl6 = 
P25 = 

P24-- 

P31 = 
Pu.3 



Q ij {co) = \q ij {t)dt 



a + X 



P02 



X 



a + X 
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-Ao = / («0 +/L )' 



(1) 



a + X 



X 



a + X 



X 



a + X 



[l-f(a Q + X)] 



[1 - f\a + X)] ,p 20 = g (a + X) , 



[l-g\a + X)] 



a, 



[l-g(a + X)], 

Pei =1 
[l-f(a +X)] 



a + A 

P41 = P52 
a. 



a Q + X 



Piia = 



a Q + X 



X 



'22.5 



a Q + X 



[l-g*(a +X)] 



[l-g*(a + A)] 



(2) 



It can be easily verified that 

P01 + P02 = PlO + Pl3 + Pl6 = P20 + P24 + P25= l O) 
The mean sojourn times (//, ) is the state Sj are 
1 1 



Mo = 



M2 = 



Mx = 



. Mi = 

a n + X a n + X 



[l-f(a +X)], 



a +X 



a + a 



[l-g (a + X)] 
• 1 



1 



a(a Q + a + X) 



. M2 



and jU, 

e ^ 6 e 



(4) 



Reliability and Mean Time to System Failure (MTSF) 

Let ^ ( (0be the cdf of first passage time from the 

regenerative state S; to a failed state. Regarding the failed state 
as absorbing state, we have the following recursive relations for 

4 (0 = 0oi (0 ® A (0 + Q 02 (0 ® A (0 
ri(0 = Gio(0®A(0+Gi 3 (0 + Gi6(0 « 

A(t) = Q 20 (t) ® Q (t) + Q 24 (t) + Q 25 (t) 
Taking LST of relations (5) and solving for^ (^) • We have 



R(s) = 



(6) 



The reliability of the system model can be obtained by taking 
inverse Laplace transformation of (6). The mean time to system 
failure (MTSF) is given by 
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MTSF =lim 



1-0 O (S) _jU Q +jU l p Ql + jU 2 p Q2 



(7) 



s 1 PoiPio P02P20 
Steady State Availability 

Let A^{t) be the probability that the system is in up- 
state at instant't' given that the system entered regenerative state 
Si at t=0. The recursive relations for A^it) are given as 
4(0 = M (t) + q m (t) 9 \(t) + q 02 (t) 9 A 2 (0 

4(0 = m,(o + q w (t) e a,(0 + q ll3 (t) e a,(0 + qjt) e 4(0 
a 2 (o = M 2 (t) + qjt) e 4(0 + ? 2M (o e 4(0 + q 225 (t) e 4(0 

A 6 (t) = q 6l (t)®A l (t) 

(8) 

where M t (t) is the probability that the system is up initially in 

state Sj 6 E is up at time t without visiting to any other 
regenerative state, we have 

M (t) = e - {aQ+X) \ M x {t) = e- (ao+X)t W), 

M 2 (t) = e' {a ° +A)t G(i) (9) 

Taking LT of relations (8) and solving for Aq(s) , the steady 

state availability is given 
by 

N 

Aq(co) = limsAjO) = — , where (10) 

s->o B 
N = n {(l- P 22 , M-Pu 3 )- P 16 P 6i ] 1 
+pAp m {i-p 225 )+ } + M 2 {P iP 6 i-P i( 1 -Piu)} 

D = Mo i (! - A1.3X 1 - P22.5 ) " PeiPie C 1 " P22.5 ) ) 

PoiC 1 " P22.5) + P02P21.4} + fA{PmP6iPi6 

+P02 (1 - A1.3 ) 1 + y"e { P02P21.4P16 + An (! ~ P22.5 ) J 
Busy Period Analysis for Server 

Let Bj (?) and 5 ; - (?) be the probability that the server is busy 

due to preventive maintenance and repair of the unit at an instant 
't' given that system entered state S; at t=0. 

B P (t) = q m (t)®B l p (t) + q 02 (t)®B p (t) 

B p x (t) = W x (t) + q w (t) 9 B p (t) + q n3 (t) 9 B p (t) + q l6 (t) 9 B p (t) 

B P 2 {t) = q 20 (t) 9 B p (t) + q 2Z5 (t) 9 B' 2 \t) + q 2U (t) 9 B p (t) 

B P 6 (t) = q 6l (t)®B p (t) 
and 
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tf(t) = q m (t) 9 B«{t) + q m {t) 9 B R 2 {t) 

Bf(f) = q w (t) © tf(t) + q ll3 (t) 9 B*(t) + q l6 (t) 9 B*(t) 

B R (0 = W 2 (t) + q 20 (t) 9 **(*) + q 2U (t) 9 B^t) + q 225 (t) 9 B 2 (t) 

B*(t) = W 6 (t) + q 61 (t)®B?(t) 

(11) 

where Wj(?) be the probability that the server is busy in state 

S; due to preventive maintenance and repair up to time 't' 
without making any transition to any other regenerative state or 
returning to the same via one or more non -regenerative state and 
so 

_ r -(a„ +X)t , -(«„ +X)t 



W l (t) = {e 



(a () e~ 



W 2 (t) = {e- (ao+l) ' + (a a e- (a ^ ) ' 81) + (l^ (a " +A) ' ®\)}G{t) , 



W 6 (t)=G(t) 



taking LT of relations (11). Solving for Bq P (t) nndB Q (?) , the 

time for which server is busy due to preventive maintenance and 
repair respectively are given by 

Bg(t) = hmsB* p (t) = ^, 

■r->0 D 



and 



B«(t) = \imsB* R (t)= I t 
«->-o D 

Where N[ (?) = W* (p m (1 - p 225 ) + p Q2 p 2lA ) , 

N 2 {t) = W*{p 02 (\ - p m - p l6 p 6l )} - <{p 16 ( P01P22.5 " Pm ~ hi ~ Pm)) 
and D has already mentioned. 

Expected Number of Repairs and Preventive Maintenances 
of the Unit 

R P 
Let Ri (?) and R{ (?) be the expected number of 

repairs and preventive maintenances of the unit by server in time 
interval (0,t] given that the system entered the regenerative state 

R P 

Sj at t=0. The recursive relations for tf/\?)and R[ (?) are given 
as 

< (0 - 2oi (0 ® ^ (0 + 2o2 (0 ® ^ (0 

R?{t) = Q w (t) ® R^t) + fi lu (0 ® <(0 + G 16 (0 ® <(0 

^(0 - 0»(0 ® [l + <(0] + G 22 ,(0 ® [l + <(0] + 21 ,(O ® [1 + ^, R (0] 

«0 = !2 61 (0®[l+<(0] 

and 

<(o=e 01 (o®<(o+Go2(o®/? 2 "(o 

^(0 = G 1 o(0®[l + <(0]+e 11 . 3 (0®[l+^(0]+G 16 (0®<(0 
^(0 = g 20 (0 ® <(0 + G 22 ,(0 ® ^(0 + G 21 . 4 (0 ® ^(0 
/? 6 "(0 = G 61 (0®<(0 

(13) 



(12) 
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Taking LST of relations (13) and solving for R^it) and R^it) 

. The expected number of repairs and preventive maintenances 
per unit time are respectively given by 
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K 



R?(oo) = lim sfif O) = 
.5^0 u D 



(14) 



N 



and R (oo) = lim sR (s) = , where 

s->a D 
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and L) has already defined. 

Profit Analysis 

The profit incurred to the system model in steady state can be 
obtained as 

P = K oA> ~ K l B ~ K 2 B ~ K i R ~ K 4 R ~ K 5 ( 15 > 

Ko= Revenue per unit up-time of the system 
K|= Cost per unit time for which server is busy due preventive 
maintenance 

K2= Cost per unit time for which server is busy due to repair 
K3= Cost per unit time repair 
1<4= Cost per unit time preventive maintenance 
K5= Total installation cost of the system 



Conclusion 

The trends of some important reliability measures have 
been observed graphically for the particular case g(t) = 6e dt 

and f(t) = OCe "'by taking numerical values of various 
parameters and costs as shown in figures 2, 3 and 4. It is found 
that mean time to system failure (MTSF), availability and profit 
function go on increasing with the increase of preventive 
maintenance and repair rates. However, their values decline with 
the increase of failure rate and the rate by which unit undergoes 
for preventive maintenance. The study also reveals that effect of 
the rate do is more on these measures as compared to the other 
rates. Hence, a cold standby system of two identical units in 
which priority is given to repair of one unit over preventive 
maintenance of the other unit can only be made reliable and 
profitable to use by repairing the unit in a time less than the time 
taken for preventive maintenance of the unit. The beauty of the 
present study is to make a suggestion that performance of a cold 
standby system can be improved by conducting preventive 
maintenance after a maximum operation time rather than to 
spent more money on repair after failure. The application of the 
present study can be visualized in a waterpump system. 
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